[Query]: Adds ability to choose global vs local/focused statistics for FullTextScore by aayush3011 · Pull Request #45686 · Azure/azure-sdk-for-python

aayush3011 · 2026-03-13T17:20:12Z

Description

Why?

Cosmos DB's implementation of FullTextScore computes BM25 statistics (term frequency, inverse document frequency, and document length) across all documents in the container, including all physical and logical partitions.

While this provides a valid and comprehensive representation of statistics for the entire dataset, it introduces challenges for several common use cases:

Multi-tenant scenarios: Tenants often operate in very different domains, which can significantly change the distribution and importance of keywords. Using global statistics leads to distorted relevance rankings for individual tenants.
Large containers with many partitions: Computing statistics across hundreds or thousands of physical partitions can be time-consuming and expensive. Customers may prefer statistics derived from only a subset of partitions to improve performance and reduce RU consumption.

This is the Python SDK port of the .NET SDK PR: Azure/azure-cosmos-dotnet-v3#5582

What?

This PR extends the flexibility of BM25 scoring so that developers can choose between:

Global (default): FullTextScore computes BM25 statistics across all documents in the container, regardless of any partition key filters. This is the existing behavior.
Local: When a query includes a partition key filter, BM25 statistics are computed only over the subset of documents within the specified partition key values. Scores and ranking reflect relevance within that partition-specific slice of data.

How?

A new full_text_score_scope keyword argument is added to query_items():

   items = container.query_items(
       query="SELECT TOP 10 * FROM c WHERE c.tenantId = @tenantId ORDER BY RANK FullTextScore(c.text, 'keywords')",
       parameters=[{"name": "@tenantId", "value": tenant_id}],
       partition_key=tenant_id,
       full_text_score_scope="Local"  # or "Global" (default)
   )

When full_text_score_scope="Local", the hybrid search aggregator uses only the query's target partition ranges (instead of all ranges) when executing the global statistics query. This is a client-side only change, no new HTTP headers are sent to the backend.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Copilot

Pull request overview

This PR adds an opt-in way to scope BM25 statistics used by FullTextScore in Cosmos DB hybrid search, allowing callers to choose between global container-wide statistics and “local” statistics limited to the query’s target partition ranges.

Changes:

Added full_text_score_scope kwarg to query_items() (sync + async) with validation and docs.
Updated hybrid search aggregators to scope global statistics queries to either all ranges (Global/default) or target ranges (Local).
Added sync/async test coverage and updated the package changelog.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
sdk/cosmos/azure-cosmos/azure/cosmos/container.py	Adds `full_text_score_scope` kwarg, validates values, documents behavior, and passes the option into query feed options.
sdk/cosmos/azure-cosmos/azure/cosmos/aio/_container.py	Async equivalent of `full_text_score_scope` kwarg support (validation + docs + feed option).
sdk/cosmos/azure-cosmos/azure/cosmos/_execution_context/hybrid_search_aggregator.py	Uses `fullTextScoreScope` option to decide whether global statistics queries target all partition ranges or only query target ranges.
sdk/cosmos/azure-cosmos/azure/cosmos/_execution_context/aio/hybrid_search_aggregator.py	Async equivalent of local vs global partition-range selection for statistics queries.
sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py	Adds sync tests for Global vs Local scope behavior.
sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search_async.py	Adds async tests for Global vs Local scope behavior.
sdk/cosmos/azure-cosmos/CHANGELOG.md	Documents the new `full_text_score_scope` parameter in the unreleased section.

You can also share your feedback on Copilot code review. Take the survey.

sdk/cosmos/azure-cosmos/azure/cosmos/container.py

sdk/cosmos/azure-cosmos/azure/cosmos/aio/_container.py

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search_async.py

simorenoh

Copilot had some comments that may be worthwhile on the tests added - LGTM otherwise

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py

aayush3011 · 2026-03-14T06:03:46Z

/azp run python - cosmos - tests

azure-pipelines · 2026-03-14T06:04:03Z

Azure Pipelines successfully started running 1 pipeline(s).

aayush3011 · 2026-03-16T19:53:57Z

/azp run python - cosmos - tests

azure-pipelines · 2026-03-16T19:54:15Z

Azure Pipelines successfully started running 1 pipeline(s).

tvaron3

LGTM

tvaron3

PR Review — FullTextScore Scope

Clean, well-scoped change. 2 comments (1 recommendation, 1 suggestion).

sdk/cosmos/azure-cosmos/azure/cosmos/container.py

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py

aayush3011 · 2026-03-16T23:52:27Z

/azp run python - cosmos - tests

azure-pipelines · 2026-03-16T23:52:44Z

Azure Pipelines successfully started running 1 pipeline(s).

aayush3011 · 2026-03-17T02:08:15Z

/azp run python - cosmos - tests

azure-pipelines · 2026-03-17T02:08:31Z

Azure Pipelines successfully started running 1 pipeline(s).

Aayush Kataria and others added 4 commits March 13, 2026 09:49

Adding changes for local and global full statistics improvements

e426646

Merge branch 'Azure:main' into users/akataria/fullTextImprovements

3fced98

Updating some code

5cb2e38

Updating changelog

5096d52

aayush3011 requested a review from a team as a code owner March 13, 2026 17:20

Copilot AI review requested due to automatic review settings March 13, 2026 17:20

github-actions bot added the Cosmos label Mar 13, 2026

github-project-automation bot added this to CosmosDB Python Eco-System Mar 13, 2026

Updating changelog

1074c27

Copilot started reviewing on behalf of aayush3011 March 13, 2026 17:21 View session

Copilot AI reviewed Mar 13, 2026

View reviewed changes

simorenoh reviewed Mar 13, 2026

View reviewed changes

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py Outdated Show resolved Hide resolved

Aayush Kataria and others added 2 commits March 13, 2026 16:39

Fixing build issues, and resolving copilot comments

ce0cf10

Merge branch 'main' into users/akataria/fullTextImprovements

393fcdf

Merge branch 'main' into users/akataria/fullTextImprovements

24c3864

tvaron3 approved these changes Mar 16, 2026

View reviewed changes

tvaron3 reviewed Mar 16, 2026

View reviewed changes

sdk/cosmos/azure-cosmos/azure/cosmos/container.py Show resolved Hide resolved

sdk/cosmos/azure-cosmos/tests/test_query_hybrid_search.py Outdated Show resolved Hide resolved

Aayush Kataria and others added 2 commits March 16, 2026 16:51

Resolving comments

18d2a6a

Merge branch 'main' into users/akataria/fullTextImprovements

dcd08a0

Aayush Kataria and others added 2 commits March 16, 2026 19:07

FIxing failing test cases

d1485e1

Merge branch 'main' into users/akataria/fullTextImprovements

453a66a

Conversation

aayush3011 commented Mar 13, 2026

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

simorenoh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aayush3011 commented Mar 14, 2026

Uh oh!

azure-pipelines bot commented Mar 14, 2026

Uh oh!

aayush3011 commented Mar 16, 2026

Uh oh!

azure-pipelines bot commented Mar 16, 2026

Uh oh!

tvaron3 left a comment

Choose a reason for hiding this comment

Uh oh!

tvaron3 left a comment

Choose a reason for hiding this comment

PR Review — FullTextScore Scope

Uh oh!

Uh oh!

Uh oh!

aayush3011 commented Mar 16, 2026

Uh oh!

azure-pipelines bot commented Mar 16, 2026

Uh oh!

aayush3011 commented Mar 17, 2026

Uh oh!

azure-pipelines bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants